Search CORE

31 research outputs found

Can Genetic Programming Do Manifold Learning Too?

Author: A Cano
A Lensen
B Tran
C Zhang
D François
F Pedregosa
H Liu
IT Jolliffe
JB Kruskal
K Neshatian
L Maaten van der
L Maaten van der
L Rodriguez-Coayahuitl
R Poli
S Nguyen
ST Roweis
Y Bengio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Exploratory data analysis is a fundamental aspect of knowledge discovery that aims to find the main characteristics of a dataset. Dimensionality reduction, such as manifold learning, is often used to reduce the number of features in a dataset to a manageable level for human interpretation. Despite this, most manifold learning techniques do not explain anything about the original features nor the true characteristics of a dataset. In this paper, we propose a genetic programming approach to manifold learning called GP-MaL which evolves functional mappings from a high-dimensional space to a lower dimensional space through the use of interpretable trees. We show that GP-MaL is competitive with existing manifold learning algorithms, while producing models that can be interpreted and re-used on unseen data. A number of promising future directions of research are found in the process.Comment: 16 pages, accepted in EuroGP '1

arXiv.org e-Print Archive

Victoria University of Wellington

Crossref

Feature Selection via Chaotic Antlion Optimization

Author: A Gholipour
A Whitney
AE Eiben
B Chakraborty
B Chakraborty
B Raman
B Ren
B Xue
B Xue
CL Huang
Crina Grosan
E. Emary
H Chen
H Kim
H Ming
HH Gao
Hossam M. Zawbaa
I Guyon
IS Oh
J Chuanwen
J Kennedy
JH Holland
JM Aguirregabiria
Josh Bongard
K Neshatian
LY Chuang
M Dash
OA Raouf
R Eberhart
R Kohavi
R Vohra
RO Duda
S Mirjalili
S Saremi
S Shoghian
SM Vieira
T Marill
V Landassuri-Moreno
XS Yang
Y Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

Selecting a subset of relevant properties from a large set of features that describe a dataset is a challenging machine learning task. In biology, for instance, the advances in the available technologies enable the generation of a very large number of biomarkers that describe the data. Choosing the more informative markers along with performing a high-accuracy classification over the data can be a daunting task, particularly if the data are high dimensional. An often adopted approach is to formulate the feature selection problem as a biobjective optimization problem, with the aim of maximizing the performance of the data analysis model (the quality of the data training fitting) while minimizing the number of features used.This work was partially supported by the IPROCOM Marie Curie initial training network, funded through the People Programme (Marie Curie Actions) of the European Union’s Seventh Framework Programme FP7/2007-2013/ under REA grants agreement No. 316555, and by the Romanian National Authority for Scientific Research, CNDIUEFISCDI, project number PN-II-PT-PCCA-2011-3.2- 0917. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

Crossref

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive

GEML: A Grammatical Evolution, Machine Learning Approach to Multi-class Classification

Author: A Kattan
A Kattan
A Mojsilović
C Downey
C Ji
DB Fogel
ER Hruschka
F Pedregosa
H Pan
H Steinhaus
K Neshatian
L Breiman
L Muñoz
M Castelli
M Keijzer
M Zhang
MC Cowgill
NS Altman
RC Barros
RE Schapire
RMA Azad
S Belhassen
S Deodhar
TG Dietterich
U Bhowan
U Maulik
UN Raghavan
W Smart
Y Ren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

In this paper, we propose a hybrid approach to solving multi-class problems which combines evolutionary computation with elements of traditional machine learning. The method, Grammatical Evolution Machine Learning (GEML) adapts machine learning concepts from decision tree learning and clustering methods and integrates these into a Grammatical Evolution framework. We investigate the effectiveness of GEML on several supervised, semi-supervised and unsupervised multi-class problems and demonstrate its competitive performance when compared with several well known machine learning algorithms. The GEML framework evolves human readable solutions which provide an explanation of the logic behind its classification decisions, offering a significant advantage over existing paradigms for unsupervised and semi-supervised learning. In addition we also examine the possibility of improving the performance of the algorithm through the application of several ensemble techniques

Crossref

Birmingham City University Open Access Repository

BCU Open Access

Enhanced uptake and transport of PLGA-modified nanoparticles in cervical cancer

Crossref

A system developed for automatic extraction and categorization of telecommunication literatures for a question answering system

Author: Hejazi M
Neshatian K
Ofoghi Bahadorreza
Publication venue: [The Symposium]
Publication date: 01/01/2003
Field of study

Deakin Research Online

Evolving genetic programming classifiers with loop structures

Author: Abdulhamid F
Neshatian K
Song A
Zhang M
Publication venue: IEEE (United States)
Publication date: 01/01/2012
Field of study

Loop structure is a fundamental flow control in programming languages for repeating certain operations. It is not widely used in Genetic Programming as it introduces extra complexity in the search

RMIT Research Repository

New Representations in PSO for Feature Construction in Classification

Author: A Unler
B Xue
J Kennedy
K Krawiec
K Neshatian
K Neshatian
M Clerc
MA Muharram
Y Marinakis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

TeLQAS: A telecommunication literature question answering system benefits from a text categorization mechanism

Author: Hejazi M
Jalali A
Mirian M
Neshatian K
Ofoghi Bahadorreza
Publication venue: CSREA Press
Publication date: 01/01/2003
Field of study

Deakin Research Online

Two-Tier genetic programming: towards raw pixel-based image classification

Author: Al-Sahaf H
Neshatian K
Song A
Zhang M
Publication venue: Pergamon (United Kingdom)
Publication date: 01/01/2012
Field of study

Classifying images is of great importance in machine vision and image analysis applications such as object recognition and face detection. Conventional methods build classifiers based on certain types of image features instead of raw pixels because the dimensionality of raw inputs is often too large. Determining an optimal set of features for a particular task is usually the focus of conventional image classification methods. In this study we propose a Genetic Programming (GP) method by which raw images can be directly fed as the classification inputs. It is named as Two-Tier GP as every classifier evolved by it has two tiers, the other for computing features based on raw pixel input, one for making decisions. Relevant features are expected to be self-constructed by GP along the evolutionary process. This method is compared with feature based image classification by GP and another GP method which also aims to automatically extract image features. Four different classification tasks are used in the comparison, and the results show that the highest accuracies are achieved by Two-Tier GP. Further analysis on the evolved solutions reveals that there are genuine features formulated by the evolved solutions which can classify target images accurately

RMIT Research Repository

Extracting image features for classification by two-tier genetic programming

Author: Al-Sahaf H
Neshatian K
Song A
Zhang M
Publication venue: IEEE (United States)
Publication date: 01/01/2012
Field of study

Image classification is a complex but important task especially in the areas of machine vision and image analysis such as remote sensing and face recognition. One of the challenges in image classification is finding an optimal set of features for a particular task because the choice of features has direct impact on the classification performance. However the goodness of a feature is highly problem dependent and often domain knowledge is required. To address these issues we introduce a Genetic Programming (GP) based image classification method, Two-Tier GP, which directly operates on raw pixels rather than features. The first tier in a classifier is for automatically defining features based on raw image input, while the second tier makes decision. Compared to conventional feature based image classification methods, Two-Tier GP achieved better accuracies on a range of different tasks. Furthermore by using the features defined by the first tier of these Two-Tier GP classifiers, conventional classification methods obtained higher accuracies than classifying on manually designed features. Analysis on evolved Two-Tier image classifiers shows that there are genuine features captured in the programs and the mechanism of achieving high accuracy can be revealed. The Two-Tier GP method has clear advantages in image classification, such as high accuracy, good interpretability and the removal of explicit feature extraction process

Victoria University of Wellington

RMIT Research Repository